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Abstract. We prove sharp limit theorems on random walks on 
graphs with values in finite groups. We then apply these re- 
sults (together with some elementary algebraic geometry number 
theory and representation theory) to finite quotients of lattices 
f-H ' in semisimple Lie groups (specifically SL(n, Z) and Sp(2«, Z)) to 

^ | . show that a "random" element in one of these lattices has irre- 

ducible characteristic polynomials (over Z). The term "random" 
can be defined in at least two ways (in terms of height and also in 
terms of word length in terms of a generating set) - we show the 
result using both definitions. 

We use the above results to show that a random (in terms of 
word length) element of the mapping class group of a surface is 
J> ' pseudo-Anosov, and that a a random free group automorphism is 

CN . irreducible with irreducible powers (or strongly irreducible^). 
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Introduction 



^ ■ This paper was inspired by the following question, first brought to 

the author's attention by Ilya Kapovich: 

Is it true that a random element of the mapping class 
group of a surface is pseudo-Anosov? 

^ ■ The definition of random in the question above is not explicitely 

given, but a reasonable way to define is to fix a generating set of the 
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mapping class group, and look at all the words of bounded lengthQ 
Kapovich had suggested that a reasonable way to attach this question 
was to study the action of the mapping class group Ai g on homology 
which gives a symplectic representation of Ai g . Results of Casson and 
Bleiler (see [4]) then give a set of sufficient conditions for a surface 
automorphism to be pseudo-Anosov in terms of its image under that 
representation. 

This paper, then, is the embodiment of the program as described 
above. Along the way, we show a number of results not explicitely 
related to low dimensional topology or geometric group theory. 

Here is a summary of the results: 

In my old preprint lli~9l I state, and sketch the proof of a general 
equidistribution theorem for products of elements of finite (and, more 
generally, compact) groups along long paths in finite graphs. In the 
current paper I give complete arguments. In the follow up-paper lli~8l 
much sharper results are given (with speed of convergence bounds). 
In particular, the convergence bounds are uniform for families of finite 
quotients of a group satisfying Lubotzky's property t (or the stronger 
Kazhdan's property T). 

We then show that the set of polynomials reducible over Z with 
constant coefficient 1 is an algebraic subvariety of the set of polynomi- 
als, and using elementary theory of algebraic groups show that the set 
of matrices in SL(n, R) (where R is the coefficient ring) with reducible 
characteristic polynomials is a finite union of Zarisky-closed sets, so 
in particular, when R = Z p , the proportion of matrices in SL(n,R) 
with reducible characteristic polynomials decreases as 0(1 /p), with 
constants independent of p (Section|5]). Using these methods together 
with Weil's estimate for the number of points on a curve over F p we 
show that the probability that the characteristic polynomial of a ma- 
trix in SL(n, Z p ) has the full symmetric group S„ as Galois group goes 
is bounded (uniformly) away from 0. Along the way, we show that 
the probability that a polynomial with fixed constant term 1 has a 
prescribed splitting type T modulo p is the same as the probability 
that a random polynomial has splitting type T (for large p). (Section 
ED). 

We quote results of Borel (proofs can be found in Nick Chavdarov's 
thesis [5]) which show that the probability that a matrix in Sp(2n, Z p ) 



Another way is to look at a combinatorial ball of radius N around identity; it 
turns out that our results, together with the results announced recently by Ursula 
Hammenstadt answer both questions. 
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has reducible characteristic polynomial is bounded away from 1 (uni- 
formly in p.) (Section [6]) 

Combining these results with with the results on walks on graphs, 
together with elementary local-to-global estimates, we show that for 
a given undirected graph G whose vertices are decorated with a sym- 
metric generating set of T (where T is either SL(n, Z) or Sp(2n, Z)), the 
probability that the product of generators along a random long walk 
of length N gives a matrix with reducible characteristic polynomial 
(or, in the case of SL(n, Z), a polynomial with non-generic Galois 
group) goes to as N goes to infinity. 

From a number-theoretic standpoint, using the length of the rep- 
resenting word in generators is a less natural way to define the size 
of an integral matrix than a(ny) matrix norm. It turns out that with 
that definition of size the result follows by combining our results on 
finite matrix groups with the work of |7j, and effective bounds can 
be obtained using the uniformity results of [16j (see Theorem |5.2|) . 

Finally, we apply the above-mentioned results to automorphisms 
of surfaces (Section |8]) and free group automorphisms (Section |9). 
For surface automorphisms we use the observations of Casson and 
Bleiler (01) to show that for an arbitrary generating set, most prod- 
ucts of up to N generators are pseudo-Anosov, and if the generating 
set happens to be symmetric, then the fraction of the non-pseudo- 
Anosov products goes to zero exponentially fast with N. It can be 
argued that it is more natural to consider all elements in a combi- 
natorial ball of radius N. Since our limit theorems for graphs are for 
undirected graphs, such a result will follow (for certain generating 
sets) once we know that the mapping class groups are bi-automatic. 

Similarly, it is shown that most words in a generating set of the 
outer automorphism group of a free group F n are irreducible with 
irreducible powers, or what Lee Mosher calls strongly irreducible. 

1. Generalities on algebraic geometry and algebraic groups 
First, recall the following: 

Definition 1.1 (iQlMp. 23). Let V c k n , and W c k m be non-empty 
algebraic sets. We say that <p : V —> W is a regular map, if there exist 
fx, ...,f m e k[Xi, X n ], (where X\, . . . , X n are indeterminantes) such 
that 

<t>(x) = (/l(4...,/raW) 

for all x G V. 
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Example 1.2. Let V = SL(n, k) c k" 2 , and let W = k n . Then the map as- 
sociating to each matrix the coefficients its characteristic polynomial 
is a regular map. 

Consider the following setup: we have a parametrized set S in k n 
(for k an algebraically closed field), that is: 

%1 = fl{S\r ■ ■ -fSm), 
x 2 = fl{S\, ■ • ■ , S m ), 

Xfi — fn{S\, ■ ■ ■ , Sin), 

where f\,...,f n are polynomials in S\, . . . ,s m . By the "Implicitization 
Theorem" ||6]] [Chapter 3], the Zariski closure of S is an affine vari- 
ety, whose dimension is bounded above by m (by considering the 
dimension of the tangent space, see [lO][Theorem 1.4.11]). 

1.1. Fibers of dominant morphisms. The following results are stan- 
dard; the statement is taken from [10]|[pp. 116,118]. First, let A[X] be 
the algebra of regular functions on X, and second, for g e A[X], let 
X f = {x G X\f(x) * 0}. Now 

Theorem 1.3. Let <p : X — > Y be a dominant morphism of irreducible 
affine varieties. In particular, <p* : A[Y] — > A[X] is injective, and d = 
dimX - dim Y > 0. Then we have a factorization 

<P\x r(s) ■ X ng) -^Y g xk* -*_> Y g , 

where <p is a finite dominant morphism, and p\ is the projection on the first 
factor. 

Corollary 1.4. Let a : X — > Y be a dominant morphism of irreducible 
varieties, and put r = dim X - dim Y. Then, there exists a non-empty open 
set U c Y, such that U Q (p(X), and such that dim (/) _1 (y) = rfor all y e U. 

Remark 1.5. U can be taken to be of the form Xf, for some / £ A[Y]. 

A dominant morphism a is a morphism such that the preimage of 
any open dense set in the image variety is dense. In particular, a 
surjective regular map is dominant. 

Example 1.6. Consider V c k n , where V = SL(n, k). The map x> which 
associates to each matrix the coefficients of its characteristic polyno- 
mial is a dominant morphism onto k n ~ x (by the companion matrix 
construction, it is surjective). By the results (and notation) above, 
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there is a g £ A[V\, such that the fibers of x restricted to V g are 
n 2 - n-dimensional varieties. 

Similarly, let Sp(2n, k) = W c A: 4 " 2 , and let x' be the map which asso- 
ciates to each matrix m the coefficients of x, . . . , x n of the characteristic 
polynomial of m. By Theorem IA.lt x' is a dominant morphism, and 
by Fact 13.31 and the results of this section, there is an h 6 A[W], such 
that the fibers of x' restricted to W/, are 2n 2 -dimensional varieties. 

2. Counting points on varieties 

Let S be a variety o f dimension m over C (or Q. Consider a reduc- 
tion of S modulo p. 

Theorem 2.1 (Lang-Weil, |15j). The number of F p points on S grows 
as 0(p m ). The implied constant is uniform (that is, it is a function of the 
dimension and codimension of the variety only). 

It should be noted that we are using this theorem for the upper 
bound only, and the upper bound is an easy result (see ||IuT Lemma 
4.1.3]), unlike the full Lang-Weil result which gives both upper and 
lower bounds. Lang and Weil is deduced from: A. Weil's estimate 
on the number of F„ points on a curve defined over F p : 

Theorem 2.2 (A. WeiL fl23| ). Let f e W p [X, y] be an absolutely irreducible 
(that is, irreducible in F p [X, Y]) polynomial of degree d. Then if 

C = {(x,y)eW 2 p \f(x,y) = 0}, 

we have the estimate 

\\C\-p\<2g^ + d 2 , 

where g is the genus of the curve defined by f (which satisfies g < (d - 
l)(d - 2). This estimate is sharp. 

Theorem 12.11 together with the results of Section ll~T1 imply: 

Theorem 2.3. With the situation as described in Corollary \lA\ if the ground 
field is F p , if S Q Y,then |a _1 (S)| < Ci\S\p r + C2(p dimX ~ 1 ) where the constants 
C\, c 2 depend only on the dimensions ofX, Y. 

3. Classical groups 

In this paper we will be primarily concerned with the special linear 
group and the symplectic group over various domains. We will need 
the following facts: 
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Fact 3.1 (See (2l OH). The groups GL(n, k), SL(n, k) and Sp(n, k) over an 
algebraically closed field k are irreducible as algebraic varieties. This is 
equivalent to saying that the groups in question are connected. 

Fact 3.2. The dimension o/SL(n, k) equals n 2 -l. The dimension o/Sp(2n, k) 
equals 2n 2 + n. In both cases, dimension is meant in the sense of algebraic 
geometry. 

LetF p = Z/pZ. 

The following goes back to Dickson: 
Fact 3.3. The order ofSL(n, F p ) equals 

p ( " 2 - n)/ V - l)(p 3 - 1) . . . (p n - 1) = p" 2 - 1 + 0(p" 2 - 3 ). 
The order of GL(n, F p ) equals 

p(" 2 -«)/2( p _ 1)(p 2 _ 1)(p3 _ i) . . . ( p « _ i) = p» 2 + Oip" 2 - 1 ). 
The order o/Sp(2n, F p ) equals 

p" 2 (p 2 - l)(p 4 - 1) . . . (p 2n - 1) = p 2 " 2+n + 0(p 2 " 2+ "" 2 ). 

4. Applications to polynomials 

Let PdiKfi) be the set of all monic polynomials in one variable of 
degree d over a field k with constant coefficient jS, and let jS, a) c 
PdiKfi) be the set of those polynomials which have a polynomial 
factor (over k) of degree m with constant term a. Let us identify 
PdiKP) with the affine space k d_1 . Then, we have the following: 

Theorem 4.1. The set P'2(k,p,a) is contained in an affine hyper surface of 
Proof. Let 

d-i 

p(x) = x d + V fljX' G F. 

By assumption, p{x) = ^(x)r(x). Assume that the degree of q{x) = m, 
while the constant term of q(x) equals a. Writing 

m-l 

q(x) = x m + V fcyX 7 , 

and 

rf-m-l 

r(x) = x rf_m + a /« + c k x k , 

k=l 
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we see that P"}(k,p,a) is a polynomially parametrized hypersurface 
of?> d (jU). ' □ 

Corollary 4.2. The set of all reducible monic -polynomials with fixed con- 
stant term ft is the union 

d 

U *> 

m=l 

ofaffine hypersurfaces ofPd(k,f$) 

Theorem 4.3. Lef P(d)(F p ) fre f/ze set of polynomials of degree d with coef- 
ficieded in absolute value by N and constant coefficient 1, and let Ri(d)(W p ) 
be the set of polynomials reducible over F p with some factor having constant 
term equal to 1 mod p. Then, R\(d) lies on an algebraic hypersurface C d_1 
(where the coordinates are the coefficients), and consequently 

Proof. Immediate from Theorem 12. 11 □ 

Finally, one definition: 

Definition 4.4. We say that a polynomial p(x) e P\{d) is reciprocal if 
x d p(l/x) = p(x) - in other words, the list of coefficients ofp is the same read 
from left to right as from right to left. Reciprocal polynomials can also be 
defined as follows: A (monic) polynomial (of even degree In) is reciprocal if 
it can be written as 

n n 

\[(x - n)(x - l/n) = Y\(x 2 - (n + r. l )x + 1). 

5. Applications to matrices 
5.1. The special linear group. 

Lemma 5.1. The probability that the characteristic polynomial of a matrix 
M in SL(n,F p ) has a factor with constant term 1 is of order 0(l/p). 

Proof. This follows immediately from Example II .61 and Theorems 14.31 
and|Z3] ' □ 

Unfortunately, since the number of integral points on SL(n, Z) of 
height (absolute value) bounded by B grows much slower than B" 2 " 1 
the above results do not imply the following 
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Theorem 5.2. The probability that a matrix in SL(ft,Z) with coefficients 
bounded by B has reducible characteristic polynomial goes toOasB goes to 
infinity. 

Proof. For the "soft" result as above we use results of Duke Rudnick 
and Sarnak ([7]) or of Eskin and McMullen ([8]). These imply that 
the matrices in SL(n, Z) with coefficients bounded by B are asymp- 
totically equidistributed among the cosets of any subgroup of finite 
index, in particular the cosets of the principal congruence subgroup 
T p (n, Z). Since reducibility modulo p depends only on the coset mod- 
ulo T p , Lemma |5T] immediately replies the result. □ 

Remark 5.3. To get an effective result, we need to know that the error 
terms in equidistribution modulo T p are uniform (do not depend on 
p.). Precisely such a result is shown in the preprint of Amos Nevo 



and Peter Sarnak [16J. 



6. Random products of matrices in the symplectic and special 

linear groups 

In the preceeding section we defined the size of a matrix by (in 
essence) its L 1 norm (any other Banach norm will give the same 
results). However, it is sometimes more natural to measure size 
differently: In particular, if we have a generating set y\,...,yi of our 
lattice T (which might be SL(n, Z) or Sp(2n, Z)) we might want to 
measure the size of an element by the length of the (shortest) word 
in y, equal to that element - this is the combinatorial measure of size. 
The relationship between the size of elements and combinatorial 
length is not at all clear, so the results in this section are proved quite 
differently from the results in the preceding section. 

We will be using Theorems II 1.21 and II 1.31 

Remark 6.1. We will be applying Theorem lll.2l to groups SL(ft, Z/pZ) 
and Sp(2n, Z/pZ). Since those groups have no non-trivial one-dimensional 
representations, the assumption on p in the statement of the theorem 
is vacuous. 

We will also need the following results of Nick Chavdarov and 
Armand Borel. 

Theorem 6.2 (N. Chavdarov, A. Borel |5]|). Let q > 4, and let R q {n) 
be the set of In x 2n symplectic matrices over the field T q with reducible 
characteristic polynomials. Then 

Wl ; 1 i 

|Sp(2n,F p )| 3n' 
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Theorem 6.3 (N. Chavdarov, A. Borel [5]). Let q > 4, and let G q (n) be 
the set of n x n matrices with determinant y + over the field F q with 
reducible characteristic polynomials. Then 

\G q {n)\ ^ i 1 

\SL(n,F q )\ < In 

Theorem 16.31 follows easily from the following result of A. Borel: 

Theorem 6.4 (A. Borel). Let F be a monk polynomial of degree N over 
Z/pZ with nonzero constant term. Then, the number #F,p of matrices in 
GL(N, p) with characteristic polynomial equal to F satisfies 

(p - 3f 2 ~ N < #(F,p) <(p + 3f 2 ~ N . 

Theorem 16.41 will be used in Section 
We now have our results: 

Theorem 6.5. Let G and Sn, be as in the statement of Theorem 111.21 but 
with T = Sp(2n, Z), or T = SL(2, Z). Then the probability that a matrix in 
S N has a reducible characteristic polynomial goes toOasN tends to infinity. 

Proof. Let T/ be the set of matrices in T reduced modulo / - it is known 
(see |fl7|) that Ti is SL(n, /) or Sp(2n, I) (depending on which T we took. 
Let pi,...,pk be distinct primes, let K = pi . . . p^. We know that: 

Lk = T pi x • • • x T pk . 

(see [TZj for the proof of the last equality). A generating set of 
Sp(2n, Z) projects via reduction modulo K to a a generating set of 
T K (see, again, Newman's book |T7||), and also, via reduction mod p ( 
to generating sets of the Sp(2n, pi). By Theorems lll.2lll.3l and l6~2l the 
probability that the characteristic polynomial in a random product 
of N » 1 generators is reducible modulo all of the p, is at most equal 
to (1 - 3 Inf. Since this is an upper bound on the probability of being 
reducible modulo Z, the result follows. □ 

Remark 6.6. Using Lemma I57T1 instead of Theorem 16.31 for SL(n,Z) 
gives a sharper result, as well as a more elementary argument. 

Remark 6.7. The proof of Theorem |5.2| translates mutatis mutandis into 
this setting - instead of the principal congruence subgroup T p for a 
single prime p, we use it for q = p x X • • • X p^. 

An example of a graph G is a graph where every vertex is connected 
to every vertex (including itself), and the set of label is a symmetric 
generating set. In this case, we are just taking random products of 
generators or their inverses. Another is the graph (studied in [19]]) 
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where a vertex labelled by a generator a is connected to every vertex 
except the one labelled by a" 1 , so that only reduced words in the 
generators are allowed, and so on. 

7. Stronger irreducibility 

We might ask if something stronger than irreducibility of the char- 
acteristic polynomial can be shown.The answer is in the affirmative. 
Indeed, the methods of the preceeding sections combined with the 
results of the Appendix give immediately: 

Theorem 7.1. The probability that a random word of length L in a gener- 
ating set of SL(N, Z) has characteristic polynomial with Galois group S N 
goes to 1 as L goes to infinity. 

Aside from its intrinsic interest, Theorem l7.1l implies the following: 

Theorem 7.2. The probability that a random word w of length L in a gener- 
ating set o/SL(n, Z) and all proper powers w k have irreducible characteristic 
polynomials goes to 1 ash goes to infinity. 

Theorem 17.21 will follow easily from Theorem 17. 1 1 together with the 
following LemmaB 

Lemma 7.3. Let M e SL(n,Z) be such that the characteristic polynomial 
ofM k is reducible for some k. Then the Galois group of the characteristic 
polynomial of M is imprimitive, or the characteristic polynomial ofM is 
cyclotomic. 

Remark 7 A. For the definition of imprimitive see, for example, |24l[TT| . 

Proof. Assume that the characteristic polynomial x(M) is irreducible 
(otherwise the conclusion of the Lemma obviously holds, since the 
Galois group of x(M) is not even transitive). Let the roots of x(M) 
(in the algebraic closure of Q) be a\,...,a n . The roots of x(M k ) are 
jSi, . . . , j6,„ where /3 ; = a k . Suppose that x(M ,c ) is reducible, and so 

there is a factor of x(M k ) whose roots are /Si, ...,/?/, for some I < n. 
Since Gal(x(M)) acts transitively on a\, . . . , a n , it must be true that for 
every i e {1, . . . , n\, a k = fij, for some / e {1, . . . , /}. Let B ; be those i for 
which a k = (5,. This defines a partition of {1, ... , n] into blocks, which 
is stabilized by the Galois group of x(M), and so G is an intransitive 
subgroup of S n , unless I = 1. In that case, the characteristic polynomial 
of M k equals (x-p) n , and since M k e SL(n, Z) it follows that = 1, and 
all the eigenvalues of M are n-th roots of unity, so that M k = 1. □ 



2 Compare with |5] [Lemma 5.3] 
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8. The mapping class group 

Let S g be a closed surface of genus g > 1, and let T g be the map- 
ping class group of S g . The group T g admits a homomorphism s 
onto Sp(2g, Z) (we associate to each element its action on homology; 
the symplectic structure comes from the intersection pairing). The 
following result can be found in ||4|: 

Theorem 8.1. For y &T g to be pseudo-Anosov, it is sufficient that g = y 
satisfy all of the following conditions: 

(1) The characteristic polynomial of g is irreducible. 

(2) The characteristic polynomial of g is not cyclotomic. 

(3) The characteristic polynomial of g is not of the form g = h(x k ),for 
some k > 1. 

The following is a corollary of our results on matrix groups: 

Theorem 8.2. Let gi,...,g k bea generating set o/Sp(2n, Z). The probabil- 
ity that a random product of length N of gi,-.-,gk satisfies the conditions 
of Theorem \8.1\ goes to 1 asN goes to infinity. 

Proof. We prove that the probability that the random word % not 
satisfy the conditions goes to 0. By Theorem 16.51 the probability that 
Wn has reducible characteristic polynomial goes to 0. In order for the 
characteristic polynomial to be of the form g = h(x k ) it is necessary 
that the linear term (the trace) vanish. The set of traceless matrices 
is a proper subvariety of Sp(2g), so by Theorem l2.il for a large p, the 
probability that a given matrix is traceless is «: 1 jp, and in particular 
is bounded away from 1. The proof of Theorem |6.5| now goes through 
verbatim to show that the set of traceless matrices is asymptotically 
negligible in Sp(2g, Z). 

Finally the set of cyclotomic polynomials of a given degree 2g is 
bounded by some function h(g). □ 

9. Free Group Automorphisms 

An automorphism of <p of a free group F n is called strongly irre- 
ducibl^ if no (positive) power of <p sends a free factor H of F„ to a 
conjugate. This concept was introduced by M. Bestvina and M. Han- 
del |H1, and many of the results of the theory of automorphisms of 
free groups are shown for such automorphisms (for a survey, and 
the relationship between strongly irreducible automorphisms and 

This terminology, with strong support from this author, has been introduced 
by L. Mosher and M. Handel for what was previously known as irreducible with 
irreducible powers 
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pseudo-Anosov automorphisms of surfaces, the reader is urged to 
read the very clear survey3l3j. By passing to the action of (p on the 
abelianization of F„ (equivalently on Hi(F n , Z)), Section^ shows the 
following: 

Theorem 9.1. Let fx, . . . , fa be a generating set of the automorphism group 
of F n . Consider all words of length L in ft,..., fa. Then, for any n, the 
probability that such a word is irreducible tends to 1 as L tends to infinity 
and also the probability that such a word is strongly irreducible tends to 1 
as L tends to infinity. 

10. Galois groups of generic restricted polynomials 

Let Pjv,rf(Z) be the set of monic polynomials of degree d with integral 
coefficients bounded by N in absolute value. It is a classical result 
of B. L. van der Waerden that the probability that the Galois group 
of p G P N/d (Z) is the full symmetric group Sd tends to 1 as N tends 
to infinity. The argument is quite elegant: First, it is observed that 
a subgroup H < Sd is the full symmetric group if and only if H 
intersects every conjugacy class of Sd. This means that H has an 
element with every possible cycle type. It is further noted that there 
is a cycle type (tlx, . . . , n k ) in the Galois group of p over Z/pZ if and 
only if p factors over Z/pZ into irreducible polynomials of degrees 
tlx,..., n k- Using Dedekind's generating function for the number of 
irreducible polynomials over Z/pZ of a given degree, it is shown 
that the probability of a fixed partition is is bounded below by a 
constant (independent of the prime p), and the proof is finished by 
an application of a Chinese Remainder Theorem. 

In this note, we ask the following simple-sounding question: Let 
Pn,cWc(Z) be the set of all polynomials in P N/C ?(Z) where the coefficient 
of x k equals a. Is it still true that the Galois group of a random such 
polynomial is the full symmetric group? The result would obviously 
follow if the probability that the Galois group of a random general 
polynomial is "generic" were to go to 1 sufficiently fast with N. In 
fact, the probability that an element of P^ is reducible (which means 
that its Galois group is not transitive, hence not S n ) is of the order of 
1 /N, so that approach does not work. 

Mimicking the proof of van der Waerden's result does not appear 
to work (at least not easily): Dedekind's argument enumerates all 

4 to appear, but available at 
http://www.math.cornell.edu~ vogtmann/papers/AutQuestions/Questions.html 
s We need to change SL(«, Z) to GL(«, Z) throughout 
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irreducible polynomials, and the result is not "graded" by specific 
coefficients. It is certainly possible that the argument can be pushed 
through, but this appears to be somewhat involved. 

Given this sad state of affairs, we first use a simple trick and Dirich- 
let's theorem on primes on arithmetic progressions to show first the 
following technical result: 

Theorem 10.1. The probability that a random element of P^^Z/pZ) 
has a a prescribed splitting type s approachs the probability that a random 
unrestricted polynomial of degree d has the splitting type s, as long asp -I 
is relatively prime to (d - k)\, and as p becomes large. 

which implies (by van der Waerden's argument): 

Theorem 10.2. The probability that a random element of P Nr d ia ,k(Z) has Sd 
as the Galois group tends tol asN tends to infinity, 

It should be noted that the (multivariate) Large Sieve (as used by 
P. X. Gallagher in 10) can be used to give an effective estimate on 
the probability in the statement of Theorems 110.21 that is: p(N) «. 
AT 1/2 logN. 

10.1. Proof of Theorem 110.21 We will need two ingredients other 
than van der Waerden's original idea. The first is A. Weil's estimate 
(Theorem I2.2|) , the second is the following classical result: 

Theorem 10.3 ([14J[Theorem VIII.9.1). ] Let k be afield, and n > 2 an 
integer. Let a e k, a + 0. Assume that for all prime numbers p such that 
p\n we have a <g. kP , and if4\n, then a g -4k 4 . Then X" - a is irreducible in 
k[X]. 

Theorem 110.31 goes essentially back to N. H. Abel's foundational 
memoir. 

We will need an additional observation: 

Lemma 10.4. Let q = p l , and let Xi,...,Xk e F 9 . Let a,b £ F p , with 
(a, b) + (1, 0) and let g(a, b)(x) = ax + bbea transformation of¥ q to itself. 
Then, it is not possible for g(a, b) to permute x x , ...,x^, ifk\ is coprime to 
p-l. 

Lemma 10.5. Consider a polynomial f of degree d over F„, such that d < p, 
and such that the coefficient of x d ~ x does not vanish. Then there is no pair 
{a, b) ± (1, 0), such that f(ax + b) = a d f(x),for all x e F p . 

Proof. There are two distinct cases to analyze. The first is when a = 1. 
In that case, f(x+b) = /(x) for all x £ F p/ and since p > d,f(x+b) = f(x), 
for all x in the algebraic closure of F p . Let r be a root of /. Then, so are 
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r + b,r + lb, . . . ,r + b(p — 1), which are all distinct as long as b ± 0, but 
since p is greater than d that means that / is identically 0. 

The second case is when a £ 1. In that case, x = b/(l - a) is 
fixed under the substitution x — > ax + b, so setting z = x — Xq, we 
see that f(ax + b) = f(az), and so f(ax + b) = a d f(x) implies that 
f(az) = a d f(z), for all z £ F p . Since p > deg/, the corresponding 
coefficients of the right and the left hand polynomials must be equal 
Since the coefficient of x d ~ l does not vanish, it follows tha a = 1, 
which contradicts our assumption. □ 

The argument now proceeds as follows. First, we note that if the 
polynomial f(x) of degree d has a certain splitting type (hence Galois 
group) over F p then so does f(ax + b)/a d , for any a ± 0, b £ F p . The set 
of all linear substitutions forms a group A, which acts freely on the 
set of polynomials of degree d, except for the (small) exceptional set 
of polynomials with a vanishing coefficient of x d ~ l as long as d < p 
(by Lemma |10.5[) , so the distribution of splitting types among the A 
orbits is the same as among all of the polynomials of degree d. Now, 
consider polynomials with constant term 1. How many of them are 
there in the A orbit of f(x)? It is easy to see that the number is equal 
to the number of solutions to 

f{b) = a d . 

If the curve C/ given by f(x)-y d is absolutely irreducible, that number 
isp+0( -\/p),byTheorem lZ2l By Theorem |10.3t in order for C/ to not be 
absolutely irreducible, we must either have that f(x) = g q (x), for some 
q\d, or f(x) = -4fo 4 (x), in case 4|rf. But the number of such polynomials 
is bounded by 0(p d/2 ), which is asymptotically neglible. So, we see 
that the distribution of splitting types amongst polynomials of degree 
d with constant term 1 is the same as for all polynomials, as long as 
d < p. 

11. Random walks on groups and graphs 

We consider the following situation: G is a finite undirected (multi)graph 
with n vertices, and T is a finite group. Each vertex V\ of G is decorated 
with an element U of T; we assume that the set {fi, ...,t n } generates T. 
We make the following assumptions: 

Condition 11.1. (1) The graph G is ergodic: the adjacency matrixA(G) 
of G is a Perron-Frobenius matrix (meaning that there is a 
unique eigenvalue of maximal modulus, and that some power 
A k (G) has all entries positive). 
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(2) We assume further that for every nontrivial one-dimensional 
unitary representation p of T, there exists 1 < i, j < n, such 
that p(ti) * p(tj). 

To each walk w = (v^v^, . . -,Vi k ) on G we associate the element 
y w = t ik . . . ti 2 ti v We now denote the set of all walks of length N by 
W N . 

Furthermore, we define a probability distribution P N on T, as follows: 
the probability density p N : T — > R+ is defined as p N (y) = \{iv e 

w N I y w = y}\/\W N \. 

A slightly more abstract way to think of this is as follows: Pn is the 
function on the group ring R[T], defined by: 

1 1V weW N 

Our main result is the following: 

Theorem 11.2. The distributions P N converge to the uniform distribution 
onT asN goes to infinity. Furthermore, there is a constant c = c(G, T, T) < 
1, such that p N (y)-l/\T\ < c N ,forally e T. Here, T refers to the assignment 
of generators of! to the vertices ofG. 

We will actually define a slightly stronger result: we will pick 
1 < i, j < n, and consider the set of all walks W N , ; of length N from 
the z'th to the ;th vertex of G. We define distributions P N/i) j in the 
obvious way Then: 

Theorem 11.3. Theorem \1 1.2\ holds with P N replaced by PN,i,j- 

The starting point for the proof of the theorems above is Fourier 
Transform on finite groups, which is discussed in Section [121 In par- 
ticular, we will be using Theorem 112.21 and Corollary 112.31 to reduce 
the question of whether a probability distribution is close to uniform 
to the proving that the Fourier Transform is small at every non-trivial 
representation. The reader might well wonder how moving the prob- 
lem to Fourier transform space helps us - the answer is that it turns 
out that we can reduce the estimation of the "fourier coefficients" to 
questions in linear algebra, through the construction in Section H3l 

12. Fourier Transform on finite groups 

For a thorough introduction to the topic of this section the reader 
is referred to H20T l2T| . Let T be a finite group, and let / : T — > C be 

a function on T. Furthermore, let T be the unitary dual of T : the set 
of all irreducible complex unitary representations of T. To / we can 
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associate its Fourier Transform f. This is a function which associates 
to each d-dimensional unitary representation p a d x d matrix f(p) as 
follows: 

/(p) = £/(y)p(y)- 

There is an inverse transformation, as well. Given a function g 

on T which associates to each rf-dimensional representation p a d x d 
matrix g(p), we can write: 

gKy) = ^J^d p tr{g{p)p{ Y -\ 

per 

where d p is the dimension of p. We mean "inverse" in the most direct 
way possible: 

/»=/. 

The following result is classical (see, eg, EH): 
Theorem 12.1. 

YA = n 

pinT 

and, together with the Fourier inversion formula, implies 
Theorem 12.2. Let g be a function on T, such that for every nontrivial 

\WMap < e, 

where |||»||| op denotes the operator norm (see Section [H]). Then, for any 
Yi, Yi e T, 

\g\n) - g\n)\ < 2e, 

Proof First, note that for the trivial representation p , the quantity 

d po g(po)po(y) = g(po), 
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so does not depend on y. By the Fourier inversion formula, then, 

\g\yi) - g\yi)\ = 



per 
P*Po 



< 



I 



1=1 



1 % ^ 



< 

byEq. @) 



per 
P*Po 

^^ 2 p lllg(p)lllop<2 e . 

per 



□ 



Corollary 12.3. Under the assumption ofTheorem \12.2[ and assuming in 
addition that g is real valued, if 

2>(y) = i, 

then 

g(y)-l/|r|<2e Vyer. 

Furthermore, ifdeT, 



(1) 



/eQ 



EG 
g(y) - iff 



< 2e|Q.| 



Proof. Without loss of generality, suppose that g(y) > 1/\T\. Then 
there is a yj, such that g(yi) < l/|r|. Thus, 

g(y)-l/\T\<g(y)-g(y 2 )<2e. 

The estimate ((D follows immediately by summing over Q. □ 

13. Fourier estimates via linear algebra 



In order to prove Theorem |11.3t we would like to use Theorem ll2.2l 
and to show the equidistribution result, we would need to show that 
for every nontrivial irreducible representation p, 



(2) 



lim 

N^oo \W N/ j,j 
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To demonstrate Eq. (O, suppose that p is /c-dimensional, so acts on 
a fc-dimensional Hilbert space H p = H. Let Z = L 2 (G) - the space of 
complex-valued functions from V(G) to C, let e\ , . . . , e n be the standard 
basis of Z, and let P, be the orthogonal projection on the z'-th coordinate 
space. We introduce the matrix 

(p(h) ... 



i=l 



p(f 2 ) ... 



... p(t n )) 

and also the matrix A p = A(G) <g> Ijj, where In is the identity operator 
on H. Both Up and A p act on Z (8) H. The following is immediate: 

Lemma 13.1. Consider the matrix (U p A p ) 1 , and think of it as an n x n 
matrix ofkxk blocks. Then the ij-th block equals the sum over all paths w 
of length I beginning at V{ and ending ofvj of p(y w ). 

Now, let Tji be the operator on Z which maps to S^e,-, 

Lemma 13.2. 

tr [((T^OsJh) (U p ®A p f(Pi®I H j\=tr p(y w ) 

Proof The argument of trace on the left hand side simply extracts the 
ij-th kxk block from (U p ® A p ) N . □ 

By submulticativity of operator norm, we see that 

WKT^P^lHiUp^ApfP^InWlp < |||(ir p ®A p ) N ||| op , 

and so proving Theorem 111.31 reduces (thanks to Theorem I12.2[) to 
showing 

Theorem 13.3. 

\\\(U p ®A p ) N \\\ op 
km — — : = 0, 

N^oo \WN,i,j\ 

for any non-trivial p. 

Notation 13.4. We will denote the spectral radius of an operator A by 
<R(A). 

Since |Wn//| x ^ n (A(G)), and by Gelfand's Theorem (Theorem 

MM, 

lim ||B N || 1/N = H(B), 

for any matrix B and any matrix norm || • ||, Theorem |13.3| is equivalent 
to the statement that the spectral radius of U p ® A p is smaller than 
that of A(G). 
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Theorem 113.31 is proved in Section [13. II 
13.1. Proof of Theorem H331 

Lemma 13.5. Let A be a bounded hermitian operator A : H — > H, and 
U : H — > H a unitary operator on the same Hilbert space H. Then the 
spectral radius of UA is smaller than the spectral radius of A, and the 
inequality is strict unless an eigenvector of A with maximal eigenvalue is 
also an eigenvector of U. 

Proof. The spectral radius of UA does not exceed the operator norm 
of UA, which is equal to the spectral radius of A. Suppose that the 
two are equal, so that there is a v, such that ||LL4z;|| = "R(A)v, and v is 
an eigenvector of UA. Since U is unitary, v must be an eigenvector 
of A, and since it is also an eigenvector of UA, it must also be an 
eigenvector of U. □ 

In the case of interest to us, p is a /c-dimensional irreducible rep- 
resentation of T, U = Diag(p(fi),. . .,p(t n ), while A = A(G) <g> I k . We 
assume that A(G) is an irreducible matrix, so that there is a unique 
eigenvalue of modulus < R(A(G)), that eigenvalue A max (the Perron- 
Frobenius eigenvalue) is positive, and it has a strictly positive eigen- 
vector v max . We know that the spectral radius of A equals the spec- 
tral radius of A(G), and the eigenspace of A max is the set of vec- 
tors of the form v max <g) w, where w is an arbitrary vector in C ,c . If 
u max = (xi,. . .,x n ), we can write u max <g> w = (xiw, . . .,x n w), and so 
U(v max <g> w) = (xip(ti)w, ... , x n p(t n )w). Since all of the Xj are nonzero, 
in order for the inequality in Lemma 113.51 to be nonstrict, we must 
have some w for which p(ti)w = cw (where the constant c does not 
depend on i.) Since the elements f, generate T, the existence of such a 
w contradicts the irreducibility of p, unless p is one dimensional. This 
proves Theorem 113.31 

14. Some remarks on matrix norms 

In this note we use a number of matrix norms, and it is useful 
to summarize what they are, and some basic relationships and in- 
equalities satisfied by them. For an extensive discussion the reader 
is referred to the classic |[T2|. All matrices are assumed square, and 
nxn. 

A basic tool in the inequalities below is the singular value decompo- 
sition of a matrix A. 

Definition 14.1. The singular values of A are the non-negative square 
roots of the eigenvalues of AA* , where A* is the conjugate transpose of A. 
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Since AA* is a positive semi-definite Hermitian matrix for any A, the 

singular values 0\ = f cr max > a 2 > . . . are non-negative real numbers. 
For a Hermitian A, the singular values are simply the absolute values 
of the eigenvalues of A. 

The first matrix norm is the Frobenius norm, denoted by ||«||. This is 
defined as 



||A|| = VtrA/F = JjV. 



This is also the sum of the square moduli of the elements of A. 
The next matrix norm is the operator norm, WHIlop/ defined as 



op = max||Ay|| = o Vl 
IMI=1 



Both the norms ||*|| and WHIlop are submultiplicative (submultiplica- 
tivity is part of the definition of matrix norm: saying that the norm 
|||*||| is submultiplicative means that \\\AB | < |||A||||||B|||.) 

From the singular value interpretation^ of the two matrix norms 
and the Cauchy-Schwartz inequality we see immediately that 

(3) ||A||/V^<ll|A||| op <||A|| 

We will also need the following simple inequalities: 
Lemma 14.2. Let Ube a unitary matrix: 

(4) |trAtr|<||A||V^<n|||A||| op . 



Proof. Since U is unitary ||LT|| = ||LT f || = ^fn. So, by the Cauchy- 
Schwartz inequality trAU < ||A||||ir|| = yfn\ \ U\\ .The second inequality 
follows from the inequality (0). n 



The final (and deepest result) we will have the opportunity to use 



is: 



Theorem 14.3 (Gelfand). For any operator M, the spectral radius ^(M) 
and any matrix norm |||«|||, 



<R(M) = lim|||M' 

k— >oo 



fe 1 1 1 1/fc 



A celebrated result of John von Neumann states that any unitarily invariant 
matrix norm is a symmetric guage on the space of singular values - [22\. 
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Appendix A. Symplectic matrices and David Kirby's Theorem 

Recall that the symplectic quadratic form in 2n dimensions is given 
by the matrix 



/ = 



n 



I 

-In 1' 



where I„ is the n x n identity matrix. A 2n x 2n matrix M is called 
symplectic if it preserves the symplectic form /, that is 

(5) M f /M = /. 

The invariance relation Eq. (0) can be written slightly more explicitly 
if we write M in n x n block form, as 



M 



A c B o 



In that case, Eq. (0 is equivalent to the following system: 

(6) A f C = C f A(= (A f C)'), 

(7) B'D = D t B(= (B'D) f ), 

(8) A*D - C*B = l n . 

This simple observation is all that is needed to prove the following 

Theorem A.l (David Kirby, Ifl3l ). Let ( R'be a commutative ring with 1, 
and let p(x) e be a reciprocal polynomial. Then, there is a symplectic 
matrix M with coefficients in %, such that 

det(xl 2 „ - M) = p(x). 

Proof sketch. In the notation of Eq. ((A), let A = 0, let detB = 1, and let 
C = — (B f ) _1 . Then, in order for 

M 

to be symplectic, it is necessary and sufficient that B l D be symmetric 
(since Eq. © and Eq. © hold automatically. 
Now, let 











(2 


9- 


{-(Br 1 


S) 



By = 



i + j <n 

1 i + j = n 

bj+j+i-n i + j> n, 
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and let D = E + F, where 




1 

-1 



\i - j\ > 1 or i = j > 1. 

|i-/l = l 
/ = / = 1, 



and 



F if = 



fo 



] < n 

otherwise 



for some \>i, . . . , b n , f\,...,f„ e "R. It is not hard to check that this is 
always so if n = 1, and when n > 1, it is necessary and sufficient that 



(9) 
(10) 



b 2 = fx + 1, 



i-\ 



b i =f i . 1 + Y J b j fi-j. 

7=2 



Now,ifp(x) = (l+x 2 ")+fl 1 *(x+x 2 "" 1 ) + . . a computation 

shows that a matrix M as above with /; = a\-\ - does the trick. We 
won't go through all the details, but the main idea in computing the 
determinant comes from the following: 



fxl -B \ B \ _ 
B" 1 xI-D hxl B" 1 ~ 



-I ) 

(x 2 + l)I-xD xB^-DB- 1 ,] 



so by taking determinants: 

det(xl - M) = det((x 2 + 1)1 - xD). 



□ 
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